Online Social Network Profile Linkage

نویسندگان

  • Haochen Zhang
  • Min-Yen Kan
  • Yiqun Liu
  • Shaoping Ma
چکیده

Piecing together social signals from people in different online social networks is key for downstream analytics. However, users may have different usernames in different social networks, making the linkage task difficult. To enable this, we explore a probabilistic approach that uses a domain-specific prior knowledge to address this problem of online social network user profile linkage. At scale, linkage approaches that are based on a näıve pairwise comparisons that have quadratic complexity become prohibitively expensive. Our proposed threshold-based canopying framework – named OPL – reduces this pairwise comparisons, and guarantees a upper bound theoretic linear complexity with respect to the dataset size. We evaluate our approaches on real-world, large-scale datasets obtained from Twitter and Linkedin. Our probabilistic classifier integrating prior knowledge into Näıve Bayes performs at over 85% F1-measure for pairwise linkage, comparable to state-of-the-art approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Social Network Profile Linkage Based on Cost-Sensitive Feature Acquisition

Billions of people spend their virtual life time on hundreds of social networking sites for different social needs. Each social footprint of a person in a particular social networking site reflects some special aspects of himself. To adequately investigate a user’s preference for applications such as recommendation and executive search, we need to connect up all these aspects to generate a comp...

متن کامل

A Hybrid Model for Linking Multiple Social Identities Across Heterogeneous Online Social Networks

Automated online profiling consists of the accurate identification and linking of multiple online identities across heterogeneous online social networks that correspond to the same entity in the physical world. The paper proposes a hybrid profile correlation model which relies on a diversity of techniques from different application domains, such as record linkage and data integration, image and...

متن کامل

Discovery and Protection of Sensitive Linkage Information for Online Social Networks Services

This paper investigates the problem of suppressing access to sensitive linkage information over data published by users of an online social network service. We unveil the potential threats by inferring linkage information from the user-published data, and suggest a class of data publishing schemes to enable distributed data publication by individual users but hide the sensitive information. Our...

متن کامل

Social Network Data Analytics Social Network Data Analytics

The advent of online social networks has been one of the most exciting events in this decade. Many popular online social networks such as Twitter, LinkedIn, and Facebook have become increasingly popular. In addition, a number of multimedia networks such as Flickr have also seen an increasing level of popularity in recent years. Many such social networks are extremely rich in content, and they t...

متن کامل

European Journal of Open, Distance and E-Learning

The most productive learning experience for students whether online or in face-to-face classes can often be the interaction among students and with an instructor. Online teaching and Social Network Analysis (SNA) offer the opportunity to examine intellectual social networking and strategies that promotes student interaction which can enhance learning. This study focuses on two online courses in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014